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(54) Using virtual URLs for load balancing 

* (57) A method, computer program product, arid 
server for use managing connection requests to a pool 
of servers identified by a given URL The method begins 
in response to a connection request from a given client 

. machine that initiates a user session for assorting a 
session identifier with a given server in the pool. The 
session identifier is then used to . generate , a Virtual" 
URL that redirects the connection request to the given 
server. Thereafter, any additional connection requests 
issued from the given client machine durfog the user 
session are redirected to the given server so that all 
content is served to the client from the same location. 
When tiie user session terminates, the virtual URL is 
inactivated and the given server fe returned to the pool 
so that it can then be assigned a new user- session to 
manage. 
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Description 



[0001] This invention rentes generally to iriforrha- 
tion retrieval in a computer network. More particularly 
the invention relates to a method and system for balanc- 
ing HTTP requests to a set of servers on a per-session 
as opposed to per-connection basis. 
I0002J The World Wide Web is the Internet's multi- 
media information retrieval system, in the Web environ- 
ment, client machines effect transactions to Web 
servers using the Hypertext Transfer Protocol (HTTP) 
which is a known application protocol providing users 
access to files (e,g„ text, graphics, images, sound 
video, etc.) using a standard page description language 
known as Hypertext Markup Language (HTML). HTML 
provides basic document .formatting and allows the 
developer to specify links" to other servers and files. In 
the Internet paradigm, a network path to a server is 
identified by a so-called Uniform Resource Locator 
(URL) having a special syntax for defining a network 
connection. Use of an HTML-conpatible browser (e.g 
Netscape Navigator or Microsoft internet Explorer) at a 
client machine involves specification of a link via the 
URL In response, the client makes a request to the 
server identified in the link and. in return, receives in 
return a document or other object formatted according 
to HTML A collection of documents supported on a 
Web server is sometimes referred to as a Web site. 
[0003] At many popular Web sites, the capacity 
demand is much greater than can be served by one 
server. Thus, H is known in the art to mirror a Web site 
and to incorporate a load balancing routine to distribute 
connection requests across a pool of servers. Typically 
such load balancing is earned out in a rbund-rbblh f^h' 
ion. Each server preferably includes the same dati sb 
any request can be handled by any of foe muftfcle serv- 
ers in the pool. This distributes the load in $n even man- 
ner.. . 

[0004] While such known load balancing schemes 
are advantageous, certain types of Web server transac- 
tions are not conducive to load balancing in this manner 
Thus, for example, assume that the Web site is a finan- 
cial institution or bank. Typically, a user accesses such 
a^etoeffectasetofoneorrrwetran^ctions < e.g an 
account balance inquiry, a transfer of funds between a 
Set of pair of accounts, and the like. When this type of 
site is mirrored for load balancing purposes, there is a 
likelihood that the user's HTTP requests will be serviced 
by different servers in the server pool. This is undesira- 
ble, and it may force the user to have to log-in repeat- 
edly to complete all of the transactions required. Similar 
problems are encountered in mirrored sites that offer 
electronic commerce transactions. 
[0005] This problem arises because prior art load 
balancing techniques operate oh a per-connection 
basis. Thus, if a given server handling an HTTP request 
suddenly becomes overloaded due to an excessive 
number of connections, the load balancing routine auto- 



matically delivers a next connection request to another 
server in the pool, even if the request originates from 
the same client machine. 

[Q006] The present invention accordingly provides 
5 m a first aspect, a method for managing connection 
requests to a pool of servers identified by a given URL, 
comprising the steps of: in response, to a connection 
request from a given client machine that initiates a ses- 
sion, associating a session identifier with a given server 
io in the pool; using the session identifier to redirect the 
connection request to the given server; and during the 
session, redirecting to the given server any additional 
connection requests from the given client machine. 
[0007] The step of using the session identifier pref- 
er eraWy . includes generating a virtual URL The virtual 
URL preferably comprises a URL in the connection 
request modified to include the session identifier. ' 
[0008] The session identifier is preferably incorpo- 
rated in data returned from the given server to the client 
so machine. 

[0069] The method according to the first aspect 
preferably further includes the step of: in response to a 
connection request from the given client machine that 
terminates the session, inactivating the session (denti- 
ns tier. 

[001 0] The given client machine preferably includes 
a browser. 

[0011] Each of the servers in the pool preferably 
supports a similar set of objects. 
so ? [0012] The session identifier is preferably associ- 
ated with a given server as a function of a load balanc- 
ing protocol, 

[0013] | n a second aspect, the present invention 
provides a method for managing connection requests to 
^ a pool of servers, corn^ 

connection requests from client rmchines . that initiate 
user sessions, associating each User session originat- 
ing from a client machine with a given Server in the pool 
in accordance with a load balancing protocol; and dur- 
40 In 9^ user session, redirecting to the given server any 
additional connection requests originating from the cli- 
ent machine. 

[0014] The associating step preferably Comprises- 
generating a virtual URL by modifying a given URL to 
45 include a session identifier; and using the virtual URL to 
redirect the connection request to the given server. 
[0015] a method according to the second aspect 
preferably further includes the step ot inactivating the 
virtual URL upon completion of the user session. 
so [0016] All data returned from given server to the cli- 
ent machine preferably includes the session identifier. 
[0017] Each of the servers in the pool preferably 
supports a similar set of given objects. 
[0018] Each client machine preferably includes a 
ss Web browser. 

[0019] In a third aspect, the present invention pro- 
vides a computer program product in a computer-read- 
able medium for managing connections requests to a 
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pool of servers, comprising: means responsive to con- 
section requests from client machines (hat initiate user 
sessions for associating each user session originating 
from a:ciierit machine with a given server in the pool in 
accordance with a load balancing protocol; and means 5 
operative during each user session for redirecting to the 
given server any additional connection requests origi- 
nating from the client machine. 
[00201 The associating means preferably com- 
prises: means for generating a virtual URL by modifying 10 
a given URL to include a session identifier; and means 
for redirecting a given connection request to the given 
server using the virtual URL 
[0021] A computer program product according to 
the third aspect preferably further includes means for 15 
inactivating the virtual URL upon conpletibn of the user 
session " . t 

[0022] In a fourth aspect, the present invention pro- 
vides a server for managing a pool of servers at a Web 
sftefcJentifiedbyagivenURLc^^ so 
an operating system; a load balancing routine; and a 
redirector routine for managing HTTP connection 
requests to the Web site, comprising: means responsive 
to connection requests from client machines that initiate 
user sessions for associating each user session brigi- 25 
riating from a client machine with a gfyen server in the 
pool in accordance wfth the load balancing routine; and 
means operative during each user session for redirect- 
ing to the given server, any additional connection 
requests originating from the cliient machine. 30 
[0023] The means for associating preferably com- 
prises: means for generating a virtual URL by modifying 
the given URL to include a s<&sibn Identifier; and 
means for redirecting a given connection request to the 
given server using the virtual URL & 
[0024] iri a. server according to, the fourth aspect 
the redirector preferably fwlher includes means for 
fnactivating the virtual URL upon completion of the user 
session 

[0025] In a fifth aspect, the present invention pro- 40 
vkJes a method of managing a poot of servers at a Web 
site identified by a given URL cornprising the steps of: 
responsive to connection -requests from client ^machines 
that initiate user sessions, associating each user ses- 
sion originating from a client machine with a server in 45 
the pool; distributing the user sessions across the pool 
bf servers according to a load balancing protocol; arid 
during a given user session initiated from a given client 
machine, serving content to the given client machine 
only from its associated served. & 
[0026] In a sixth aspect, the present inverttion pro- 
vides a method of managing a pool of servers at a Web 
site identified by a given URL comprising the steps of: 
during each user session initiated from a given client 
machine, temporarily redirecting an connection ss 
requests originating from the cfient machine to a given 
server in the pool; and distrtouting the user sessions 
across the pool of servers according to a load balancing 



protocol. 

[0027] It is preferred in this invention to provfcle a 
method for equitably distributing client requests across 
a set of servers on a per-session basis. Preferably, a 
given server in the set is allocated a given number of 
sessions, as opposed to a given number of connections, 
and thus a user's HTTP connection requests are prefer- 
ably service! from the same server in the set through- 
out the session. 

[0028] It is further preferred in this invention to 
implement a load balancing routine across a set of serv- 
ers wherein given connection requests originating from 
a client machine are serviced from the same server in 
the set 

[00&9] A still further preferred feature of the present 
invention is to manage HTTP connection requests from 
a client machine using a so-called virtual URL that 
defines a user session. The client is then notified that 
that the requested URL has been moved to a specific 
server. 

[0030] Yet another preferred feature of the present 
invention is to balance connection requests to a pool of 
servers on a session basis so that each server has a 
predefined number of users that may obtain access to 
documents supported on that server. 
[0031] It is still another preferred feature of this 
invention to redirect HTTP connection requests originat- 
ing from a user ot a given client machine to a specific 
server during a session 

[0032] These and other preferred features of the 
invention are provided in a method, corrputer program 
product and server .for. use managing connection 
requests to a pool of servers identified by a given URL 
In one embodiment, the method begins in response to a 
connection request from a given cfient machine that ini- 
tiates a user session in response to the request the 
method associates a septan identifier with a given 
server in the pool. The session fcJerttifieris then used to 
generate a •Virtual" URL that redirects the connection 
request to the given server. Thereafter, any additional 
connection requests issued from the given client 
machine during the user session are redirected to the 
given server so that all conterit is served to the client 
from the same location When the user session termi- 
nates, the virtual URL is inactivated and the given 
server is returned to the pool so that h can then be 
assigned a new user session to manage. 
[0033] Thus, according to preferred embodiments 
of the invention, a method for managing connection 
requests to a pool of servers is responsive to connec- 
tion requests from client machines that initiate user ses- 
sions for assaying each user session originating from 
a client machine with a given server in the pool; User 
sessions are associated with the servers in the pool in 
accordance with a loadbalancing protocol. During each 
user session any additional connection requests origi- 
nating from a given dient machine are then reelected 
to the server that is managing the session The user 
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s ^onisass^tedvvithagivenMrverbygefiGrtting 
a virtual URL that includes a unique session identifier 
which identifier is then provided with all data returned 
from the server to the client machine. The virtual URL is 
inactivated upon completion of the user session. s 
[0034j Preferred embodiments of the Invention also 
preferably comprise, a server for managing a pool of 
servers at a Web site identified by a given URL The 
server may comprise a processor, an operating system 
a load balancing routine, and a redirector routine for to 
managing HTTP connection requests to the Web site 
The redirector routine is a front end process that runs in 
the server and includes means responsive to connec- 
tion requests from client machines that initiate user ses- 
sions for associating each user session originating from is 
a client machine with a given server through a unique 
session identifier. During each user session, connection 
requests originating from the client machine* the given 
URL are redirected to the given server, 
[0035] A preferred emtocfiment of the present so 
invention wifl now be described by way of example with 
reference to the accompanying drawings, irt which- 



Figure 1 is a representative system in which a pre- 
ferred embodiment of the present invention is ss 
implemented; 

Figure 2 is a flowchart illustrating the conventional 
processing associated with an HTTP request from 
the Web client to the Web server shown in Figure 1; so 

Figure 3 is a block diagram of a client machine con- 
necting to a Web site comprising a server pool: 

Figure 4 is a state diagram fflustrating the inventive as 
functionality of the redirector routine; and 

Figure 5 is a flowchart illustrafing the function of the 
. redirector routine. 

[0036] A known Internet dienfcserver system is *° 
implemented is illustrated in Rgure 1. A client machine 
10 is connected to a Web server 12 via network 14. For 
illustrative purposes, network 14 is the Internet an 
intranet, an extranet or any other known network. Web as 
server 12 is one of a plurality of servers which are 
accessible by clients, one of which is illustrated by 
machine 10. A representative client machine includes a 
browser 16, which is a known software tool used to 
access the servers of the network. The Web server sup- so 
ports files (collectively referred to as a "Web" site) in the 
form of hypertext documents and Objects. In the Internet 
paradigm, a network path to a server is identified by a 
so-called Uniform Resource Locator (URL) 
[0037] A representative Web server 12 is an IBM ss 
Netfmrty server comprising a RISC>based processor 18 
the AIX operating system 20 and a Web server program 
22. such as Netscape Enterprise Server. The server 12 



also includes a display 24 supporting a graphical user 
interface (QUO for managementand administration, and 
an Application Programming Interface (API) 23 that pro- 
vides extensions to enable application developers to 
extend and/or customize the core functtonaiity thereof 
through software programs Including Common Gateway 
Interface (CGI) programs, plug-ihs. servlets. active 
server pages, server side include (SSI) functions or the 

IlKO. 

[0038] A representative Web client is a personal 
computer that is x86-. PowerPC- or RISC-based, that 
includes an operating system such as IBM OS/2 or 
Microsoft Windows -95. and that includes a Web 
browser, such as Netscape Navigator 4.0 (or higher) 
having a Java virtual Machine (JVM) and support for 
application plug-ins or helper applications. 
[0039] The Web server accepts a client request and 
returns a response. The operation of the server pro- 
gram 22 Is governed by a number of server application 
functions (SAFs). each of which is configured to execute 
in a certain step of a sequence. This sequence, illus- 
trated in Figure 2 by way of background only, begins 
with authorization translation (AuthTrans) 30. during 
which the server translates any authorization informa- 
tion sent by the client into a user and a group. If neces- 
sary, the AuthTrans step rnay decode a message to get 
ttie actual client request At step 32. called name trans- 
lation (NameTrans), the URL associated with the 
request may be kept intact or it can be translated into a 
system-dependent file name, a redirection URL or a 
rrtnpr site URL At step 34. called path checks (Path- 
Check), the server performs various tests On the result- 
ing path to ensure that the given client, may retrieve the 
document At step 36, sometimes referred to as object 
types (ObjectType). MIME (Mulfipurpo.se Internet Mail 
Extension) type information (e.g.. text^, jmage/gif 
etc.) for the given document is tteriWied. At step 38 
called Seryice (Service). the Web server routine setects' 
arnnternal server function to send the result back to the 
dient.This function can run the normal: server service 
routine (to return a fie), some other server function 
(such as a program to return a custom document) or a 
CGI program At step 40. called Add Log (AddLog) 
information about the transaction is reoordedl 
[0040] Figure 3 is a Mock diagram illustrating the 
environment in which the present invention is imple- 
mented. In this illustration, dient machine 42 makes 
HTTP requests to a Web site comprising a managing 
server 44. and a set of mirrored servers 46a*46n man- 
aged by the managing server. Managing server 44 may 
comprise one of the mirrored servers. Managing server 
44 includes a load bafendng. routine 48 for keeping 
track of the load on each of the mirrored servers and for 
directing service requests to the servers according to a 
load balancing akjorithm. According to the present 
invention, the managing server further indudes a redi- 
rector routine 50 that provides the enhanced functional- 
ity of the present invention . 
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[0041] As will be seen, redirector routine 50 acts as 
a front end process for parsing client requests and for 
determining whether a given request represents the 
beginning or end of a Session." As used herein, a ses- 
sion represents a set of connection-less transactions s 
between a given user (at a client machine) and the Web 
site. For example, if the managing server is a bank Web 
site, the session involves a set of queries to the server 
from the user, together with the responses served from 
the servers. In this example, a user logs in and is recog- 10 
need as being authorized to access given information 
(e.g.. a bank account balance). According to thepresent 
invention, the redirector enables all of requests originat- 
ing from the client machine to the site, and an of the 
information defivered to the client machine, to be served , 5 
to and from the same server during a given "session - 
One of ordinary skill in the art will thus appreciate that 
this enables the load to be balanced on a per-session 
basis, as opposed to a per-comeoion basis as in the 
priorart. 

20 

[0042] This operation is new described with respect 
to the state diagram of Figure 4, together with the flow- 
chart of Figure 5. Sequential steps are numbered in the 
state diagram. The routine begins at step 52 with the cli- 
ent making a requ^t to initiate a session. lypicaUy; this ss 
is a login request which may occur, for example; by 
having the user access the Web site (through its URL) 
and then enter infbrmatioh Into a CGMaased form. This 
Is a conventional login fransactiori over the Internet 
Returning to the flowchart, in step 52. the cfient ao 
nrachine thus issues an HTTP request (for example, 
nttoy/wwwiartfccornyioginiitrnl) to the managing 
server. At step 54. the redirector intercepts the request 
and recognizes that the user has requested theinitiation 
of a session. Slewing (cpfionaO.en^ „ 
a userid and password, the routine continues at step 56 
wherein the redirector queries the load balancing rou- 
tine to determine which of the servers in the set win 
service the session. This aitcorrie of this test depends 
on theparticular load balancing algorithm being imple- ao 
[nerted- As noted above, the present invention enables 
HTTP connection requests and associated server 

responses to be managing through one server of the set 
throughout the entire session; 

[0043] At step 58. the redirector associates a ses- <s 
sion identifier with the session and then notifies the 
server (in this example, server S4) that has been 
selected to manage the session of the session identifier 
Preferably, the session identifier is a unique number 
(ag., an increasing sequential , number). At step 60 so 
server S4 records the session identifier in a session 
table 55. Each server in the server pool preferably has 
an associated session table 55 for storing session iden- 
tifiers of the sessions being, managed by that server 
The redirector likewise includes an appropriate data' ss 
structure 57 maintaining information about which server 
is managing which session. At step 62. the redirector 
returns an appropriate redirection response (e.g.. "302 : 



URL has moved") to the client This response also iden- 
Mies the URL of the server and includes the session 
identifier. e.g.: "httptfserver 4 URL/session identi- 
fier/login.html. This is sometimes referred to herein as a 
Virtual" URL This completes the redirection function. 
[0044] the client then continues at 6tep 64 by issu- 
ing a new HTTP connection request, using the URL 
passed from the managing server during step 6^ This 
redirection process is typically an automatic function 
earned out by the browser in response to the 302 
response. The routine then continues at step 66 with the 
server S4 serving the requested data All data returned 
to the client contains the session identifier as part of the 
URL base. Throughout the remainder of the session, all 
HTTP connection requests from this particular client 
machine are managed through server S4. This is illus- 
trated at step 68 in the flowchart 
[0045] A test is performed repeatedly at step 70 to 
determine whether the session is to be terminated Typ- 
ically, this request is Identified by the redirector because 
the user has taken sortie action that generates a given 
logoff request Thus, for example, the redirector has 
parsed the input HTML stream and recognized the fol- 
lowing request: "httotfserver 4 URUsession identi- 
f ier/logoff.rltjnr. At this point in the routine, namely, at 
step 72. the sewer S4 deletes the session identifier 
from the session taWe 55 and performs the requested 
logoff action. Deletion of the session identifier is some- 
times referred to as Inactivating" the identifier because 
once the session Identifier is removed from the table' 
the client machine is rio longer explicitly coupled to the 
server that, up until that -time, had been handling the 
connection requests originating from that machine dur- 
ing the user session. . 

[0046] The routine then continues at step 74 to 
return a logoff screen to the client machine. The server 
then notifies the redirector that the user associated with 
the session identifier has logged off. At step 76. the redi- 
rector. releases the server S4 from the session thus 
making the server available to handle a new session 
from another user. 

[0047] Thite. the redirector maintains a running 
count of the number of sessions being managed by 
each server In the pool, as well as the identification of 
the actual session identifiers being managed at each 
server. As a user completes his or her session, the 
server that has been managing the session becomes 
available to the redirector (and. thus, the load balancing 
routine) to service another set of connection requests 
from another user. thus, load balancing is achieved on 
a per-session. as opposed to a per-connecfion. basis. 
This provides significant advantages over the prior art 
[0048]: In particular, a given user may now under- 
take a set of transactions arid be assured that all such 
transactions are managed by the same server. By 
implementing the redirection function, the user is 
assured that he or she may carry out a set of transac- 
tions without having to login repeatedly to the server. 
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This Is quite advantageous as the transactions are con- 
nection requests that are communicated in a connec- 
tion-less (i.e. a "stateless7 operating environment By 
aWing the virtual URL to the redirection request the 
target server has the capability to count actual user ses- 
sions and thus impose an upper limit of users instead of 
connections. Another benefit is that if the user cannot 
bookmark a page in the middle of a session because 
the virtual URL is short-lived (i.e. it only lasts as long as 
the session lasts). This ensures that the user is forced 
to back to the beginning URL to begin a new session, 
thus preventing restarts in the middle of a particular 
session. The redirector provides a consistent session 
flow. 

[0049] The above-descrfoed functionafity preferably 
is irrplemented in software running on the managing 
server. There is no requirement to modify the client-side 
software as the redirection to the virtual URL takes 
place automatically. The redirector thus may be plug-in 
enabled I code, preferably a Java servlet The functional- 
ity thus is implemented in software executable in a proc- 
essor, namely, as a set of instructions (program code) in 
a code module resident in the random access memory 
of the computer. Until required- by the corrputer, the set 
of instructions may be stored in cuwfher computer mem- 
ory, for example, in a hard disk drive,, or in a removable 
rriemory such as an optical disk (for eventual use in a 
CO ROM) or floppy disk (for eventual use in a floppy disk 
drive), or downloaded via the Internetor other corrputer 
network. 

[0050] In addition, although the various methods 
described are conveniently irrplemented in a general 
purpose computer selectively activated or reconfigured 
by software, one of ordinary skill in the art would also 
recognize thett such methods may be carried out in 
taMtfware. in firmware, or irt mcr6 €pecianzed apparatus 
constructed to perform the required method steps. 
[0051J Further, as used herein, a Web Went* 
means any corrputer or corrpbnent thereof directly or 
indirectly connected or correctable in any known or 
later-devetoped manner to a computer netwwK such as 
the Internet The term Web "server means a computer, 
corrputer platform, an adjunct to a computer or plat- 
form, or any component thereof, Of course, a "client- 
means one who requests or gets the file, and "server" is 
the entity which downloads the fila 

Claims 



1 * A .nw*wd for managing connection requ^ts to a so 
pool of servers identified by a given URL, compris- 
ing the steps of; 

in response to a connection request from a 
given client machine that initiates a session, ss 
associating a session identifier with a. given 
server in the pool; 
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using the session identifier to redirect the con- 
nection request to the given server; and 

during the session; redirecting to the given 
server any additional connection requests from 
the given client machine. 

The method as claimed in claim 1 wherein the step 
of using the session identifier includes generating a 
virtual URL 

The method as claimed in claim 2 wherein the vir- 
tual URL comprises a URL in the connection 
request modified to include the session identifier. 

The method as claimed in any preceding claim 
wherein the session identifier is incorporated in 
data returned from the given server to the client 
machine. 

The method as claimed in any preceding claim fur- 
ther including the step of: 

in response to a connection request from the 
given client machine that terminates the ses- 
sion inactivating the session identifier. 

The method as claimed in any preceding claim 
wherein the given client machine includes a 
browser. 

The method, as claimed in any preceding claim 
wherein each of the servers in the pool supports a 
similar set of pt^ects. 

The method as claimed in any preceding daim 
wherein the session identifier is associated with a 
given server as a function of a load balancing proto- 
col. 
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A computer program product in a computer-reada- 
ble medium for managing connections requests to a 
pool of servers, corrprising: 

means responsive to connection requests from 
client machines that initiate user sessions for 
associating, each user session originating from 
a client machine with a given server in the pool 
in accordance with a load balancing protocol; 
and 

means operative during each user session for 
redirecting to- the given server any additional 
connection requests originating from the dient 
machine. 

10. A server for managing a pool of servers at a Web 
site identified by a given URL, corrprising: 
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aprocessor; 
an operating system; 
a load balancing routine; and 

a retfrector routine for managing HTTP con- 
nection requests to the Web site, comprising: 

means responsive to connection requests 
from client machines that initiate user ses- 
sions for associating each user session 
originating from a client machine with a 
given server in the pool in accordance with 
the load balancing routine; and 

means operative during each user session 
for redirecting to the given server any addi- 
tional connection requests originating from 
the client machine. 
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